Load-aware Adaptive Failover for Middleware Systems with Passive Replication
نویسندگان
چکیده
Supporting uninterrupted services for performance-sensitive distributed applications operating in resource-constrained environments is hard. It is even harder when the operating environment is dynamic and processor or process failures and system workload changes are common. Fault-tolerant middleware for these applications must assure high service availability and satisfactory response times for clients. Although passive replication is a promising fault tolerance strategy for resourceconstrained systems, conventional passive replication solutions are nonadaptive and load-agnostic, which can cause post-recovery system overloads and significantly increase response times. This paper presents Faulttolerant Load-aware and Adaptive middlewaRe (FLARe), which enhances conventional passive replication schemes in three ways. First, its client failover strategy is load-aware, i.e., failover targets are selected at runtime based on current CPU utilizations to maintain satisfactory response times and alleviate CPU overload after failure recovery, and adaptive, i.e., failover targets are proactively adjusted in response to failures, system load fluctuations, and resource availability. Second, its client redirection strategy handles resource overloads that stem from multiple failures and workload fluctuations. Third, FLARe enables effective dissemination of failover decisions and manages CPU utilizations transparently to clients. Empirical evaluations on a distributed testbed demonstrate how FLARe efficiently uses available system resources and maintains satisfactory response times for clients when recovering from failures.
منابع مشابه
Adaptive Middleware for Data Replication
Dynamically adaptive systems sense their environment and adjust themselves to accommodate to changes in order to maximize performance. Depending on the type of change (e.g., modifications of the load, the type of workload, the available resources, the client distribution, etc.), different adjustments have to be made. Coordinating them is already difficult in a centralized system. Doing so in th...
متن کاملResource-Aware Deployment and Configuration of QoS-enabled Middleware
Ad hoc deployment and configuration (D&C) of faulttolerance mechanisms (e.g., replica-host mapping and failover ordering of replicas) can lead to unacceptable response times, overloads, and low-availability for soft realtime applications. This paper describes how our quality-ofservice (QoS)-enabled middleware called DeCoRAM (Deployment and Configuration Reasoning via Analysis and Modeling) prov...
متن کاملMDDPro: Model-Driven Dependability Provisioning in Enterprise Distributed Real-Time and Embedded Systems
Service oriented architecture (SOA) design principles are increasingly being adopted to develop distributed real-time and embedded (DRE) systems, such as avionics mission computing, due to the availability of real-time component middleware platforms. Traditional approaches to fault tolerance that rely on replication and recovery of a single server or a single host do not work in this paradigm s...
متن کاملOptimizing Distributed System Performance via Adaptive Middleware Load Balancing
Load balancing middleware is used extensively to improve scalability and overall system throughput in distributed systems. Many load balancing middleware services are simplistic, however, since they are geared only for specific usecases and environments. These limitations make it hard to use the same load balancing service for anything other than the distributed application it was designed for ...
متن کاملProposal of Grid Monitoring System with Fault Tolerance
A Grid monitoring system is differentiated from a general monitoring system in that it must be scalable across wide-area networks, include a large number of heterogeneous resources, and be integrated with the other Grid middleware in terms of naming and security issues. A Grid Monitoring is the act of collecting information concerning the characteristics and status of resources of interest. The...
متن کامل